Annotation of Sentence Structure; Capturing the Relationship among Clauses in Czech Sentences

نویسندگان

  • Markéta Lopatková
  • Natalia Klyueva
  • Petr Homola
چکیده

The goal of the presented project is to assign a structure of clauses to Czech sentences from the Prague Dependency Treebank (PDT) as a new layer of syntactic annotation, a layer of clause structure. The annotation is based on the concept of segments, linguistically motivated and easily automatically detectable units. The task of the annotators is to identify relations among segments, especially relations of super/subordination, coordination, apposition and parenthesis. Then they identify individual clauses forming complex sentences. In the pilot phase of the annotation, 2,699 sentences from PDT were annotated with respect to their sentence structure.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Annotation of sentence structure - Capturing the relationship between clauses in Czech sentences

The focus of this article is on the creation of a collection of sentences manually annotated with respect to their sentence structure. We show that the concept of linear segments—linguistically motivated units, which may be easily detected automatically—serves as a good basis for the identification of clauses in Czech. The segment annotation captures such relationships as subordination, coordin...

متن کامل

Segmentation of Complex Sentences

The paper describes a method of dividing complex sentences into segments, easily detectable and linguistically motivated units that may be subsequently combined into clauses and thus provide a structure of a complex sentence with regard to the mutual relationship of individual clauses. The method has been developed for Czech as a language representing languages with relatively high degree of wo...

متن کامل

Obtaining Hidden Relations from a Syntactically Annotated Corpus - From Word Relationships to Clause Relationships

The paper concentrates on obtaining hidden relationships among individual clauses of complex sentences from the Prague Dependency Treebank. The treebank contains only an information about mutual relationships among individual tokens (words, punctuation marks), not about more complex units (clauses). For the experiments with clauses and their parts (segments) it was therefore necessary to develo...

متن کامل

A Structural Organization of Modern English Multiple Complex-Compound

The article focuses on the factors that cause linear and vertical sentence extension of multiple complex-compound sentences used in English fictional literature. Considering the sentence structure as a combination of 2 Units – paratactic and hypotactic the authors define the structural peculiarities of paratactic and hypotactic units including the number of clauses and its bonds. The extension ...

متن کامل

Comprehension of Complex Sentences in the Persian-Speaking Patients With Aphasia

Introduction: To study sentence comprehension in Persian-speaking Patients with Aphasia considering the factors of complexity. Methods: In this cross-sectional study, the performance of 6 non-fluent aphasic patients were tested and their performance was compared to 15 matched control group. Comprehension of semantically reversible sentences was assessed using a binary sentence-picture matching...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2009